NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Taylor-Model Physics-Informed Neural Networks (PINNs) for Ordinary Differential Equations

Nagesh, Chandra Kanth; Sankaranarayanan, Sriram; Kaur, Ramneet; Sahai, Tuhin; Jha, Susmit (May 2025, Proceedings of Machine Learning Research)
Pappas, George; Ravikumar, Pradeep; Seshia, Sanjit A (Ed.)
We study the problem of learning neural network models for Ordinary Differential Equations (ODEs) with parametric uncertainties. Such neural network models capture the solution to the ODE over a given set of parameters, initial conditions, and range of times. Physics-Informed Neural Networks (PINNs) have emerged as a promising approach for learning such models that combine data-driven deep learning with symbolic physics models in a principled manner. However, the accuracy of PINNs degrade when they are used to solve an entire family of initial value problems characterized by varying parameters and initial conditions. In this paper, we combine symbolic differentiation and Taylor series methods to propose a class of higher-order models for capturing the solutions to ODEs. These models combine neural networks and symbolic terms: they use higher order Lie derivatives and a Taylor series expansion obtained symbolically, with the remainder term modeled as a neural network. The key insight is that the remainder term can itself be modeled as a solution to a first-order ODE. We show how the use of these higher order PINNs can improve accuracy using interesting, but challenging ODE benchmarks. We also show that the resulting model can be quite useful for situations such as controlling uncertain physical systems modeled as ODEs.
more » « less
Free, publicly-accessible full text available May 12, 2026
Anticipating Oblivious Opponents in Stochastic Games

Tasdighi_Kalat, Shadi; Sankaranarayanan, Sriram; Trivedi, Ashutosh (October 2024, Springer Proceedings in Advanced Robotics (SPAR))
Amato, Nancy; Driggs-Campbell, Katie; Ekenna, Chinwe; Morales, Marco; O'Kane, Jason (Ed.)
We present an approach for systematically anticipating the actions and policies employed by oblivious environments in concurrent stochastic games, while maximizing a reward function. Our main contribution lies in the synthesis of a finite information state machine (ISM) whose alphabet ranges over the actions of the environment. Each state of the ISM is mapped to a belief state about the policy used by the environment. We introduce a notion of consistency that guarantees that the belief states tracked by the ISM stays within a fixed distance of the precise belief state obtained by knowledge of the full history. We provide methods for checking consistency of an automaton and a synthesis approach which, upon successful termination, yields an ISM. We construct a Markov Decision Process (MDP) that serves as the starting point for computing optimal policies for maximizing a reward function defined over plays. We present an experimental evaluation over benchmark examples including human activity data for tasks such as cataract surgery and furniture assembly, wherein our approach successfully anticipates the policies and actions of the environment in order to maximize the reward.
more » « less
Full Text Available
What is Your Discount Factor?

Tasdighi_Kalat, Shadi; Sankaranarayanan, Sriram; Trivedi, Ashutosh (August 2024, Springer, Cham)
Hillston, Jane; Soudjiani, Sadegh (Ed.)
We study the problem of inferring the discount factor of an agent optimizing a discounted reward objective in a finite state Markov Decision Process (MDP). Discounted reward objectives are common in sequential optimization, reinforcement learning, and algorithmic game theory. The discount factor is an important parameter used in formulating the discounted reward. It captures the “time value” of the reward- i.e., how much reward at hand would equal a promised reward at a future time. Knowing an agent’s discount factor can provide valuable insights into their decision-making, and help predict their preferences in previously unseen environments. However, pinpointing the exact value of the discount factor used by the agent is a challenging problem. Ad-hoc guesses are often incorrect. This paper focuses on the problem of computing the range of possible discount factors for a rational agent given their policy. A naive solution to this problem can be quite expensive. A classic result by Smallwood shows that the interval [0, 1) of possible discount factor can be partitioned into finitely many sub-intervals, such that the optimal policy remains the same for each such sub-interval. Furthermore, optimal policies for neighboring sub-intervals diﬀer for a single state. We show how Smallwood’s result can be exploited to search for discount factor intervals for which a given policy is optimal by reducing it to polynomial root isolation. We extend the result to situations where the policy is suboptimal, but with a value function that is close to optimal. We develop numerical approaches to solve the discount factor elicitation problem and demonstrate the eﬀectiveness of our algorithms through some case studies.
more » « less
Full Text Available
Algorithms for Identifying Flagged and Guarded Linear Systems

https://doi.org/10.1145/3641513.3650140

Berger, Guillaume; Narasimhamurthy, Monal; Sankaranarayanan, Sriram (May 2024, ACM)

Full Text Available
Cone-Based Abstract Interpretation for Nonlinear Positive Invariant Synthesis

https://doi.org/10.1145/3641513.3650127

Berger, Guillaume; Ghanbarpour, Masoumeh; Sankaranarayanan, Sriram (May 2024, ACM)

Full Text Available
Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

https://doi.org/10.1145/3644815.3644989

Tizpaz-Niari, Saeid; Sankaranarayanan, Sriram (April 2024, ACM)

Full Text Available
Temporal Behavior Trees: Robustness and Segmentation

https://doi.org/10.1145/3641513.3650180

Schirmer, Sebastian; Singh, Jasdeep; Jensen, Emily; Dauer, Johann; Finkbeiner, Bernd; Sankaranarayanan, Sriram (May 2024, ACM)

Full Text Available
Template-based piecewise affine regression

https://doi.org/10.1017/cbp.2024.3

Berger, Guillaume O; Sankaranarayanan, Sriram (January 2024, Research Directions: Cyber-Physical Systems)

Abstract We study the problem of fitting a piecewise affine (PWA) function to input–output data. Our algorithm divides the input domain into finitely many regions whose shapes are specified by a user-provided template and such that the input–output data in each region are fit by an affine function within a user-provided error tolerance. We first prove that this problem is NP-hard. Then, we present a top-down algorithmic approach for solving the problem. The algorithm considers subsets of the data points in a systematic manner, trying to fit an affine function for each subset using linear regression. If regression fails on a subset, the algorithm extracts a minimal set of points from the subset (an unsatisfiable core) that is responsible for the failure. The identified core is then used to split the current subset into smaller ones. By combining this top-down scheme with a set-covering algorithm, we derive an overall approach that provides optimal PWA models for a given error tolerance, where optimality refers to minimizing the number of pieces of the PWA model. We demonstrate our approach on three numerical examples that include PWA approximations of a widely used nonlinear insulin–glucose regulation model and a double inverted pendulum with soft contacts.
more » « less
Full Text Available
Counterexample-guided computation of polyhedral Lyapunov functions for piecewise linear systems

https://doi.org/10.1016/j.automatica.2023.111165

Berger, Guillaume O.; Sankaranarayanan, Sriram (September 2023, Automatica)

This paper presents a counterexample-guided iterative algorithm to compute convex, piecewise linear (polyhedral) Lyapunov functions for continuous-time piecewise linear systems. Polyhedral Lyapunov functions provide an alternative to commonly used polynomial Lyapunov functions. Our approach first characterizes intrinsic properties of a polyhedral Lyapunov function including its “eccentricity” and “robustness” to perturbations. We then derive an algorithm that either computes a polyhedral Lyapunov function proving that the system is asymptotically stable, or concludes that no polyhedral Lyapunov function exists whose eccentricity and robustness parameters satisfy some user-provided limits. Significantly, our approach places no a-priori bound on the number of linear pieces that make up the desired polyhedral Lyapunov function. The algorithm alternates between a learning step and a verification step, always maintaining a finite set of witness states. The learning step solves a linear program to compute a candidate Lyapunov function compatible with a finite set of witness states. In the verification step, our approach verifies whether the candidate Lyapunov function is a valid Lyapunov function for the system. If verification fails, we obtain a new witness. We prove a theoretical bound on the maximum number of iterations needed by our algorithm. We demonstrate the applicability of the algorithm on numerical examples.
more » « less
Full Text Available
More Than a Number: A Multi-dimensional Framework For Automatically Assessing Human Teleoperation Skill

https://doi.org/10.1145/3568294.3580167

Jensen, Emily; Hayes, Bradley; Sankaranarayanan, Sriram (March 2023, Intl. Conf. Human-Robot Interaction, HRI 2023 (Late Breaking Report))

Full Text Available

« Prev Next »

Search for: All records